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Prediction and control of network dynamics are grand-challenge problems in network science. The 
lack of understanding of fundamental laws driving the dynamics of networks is among the reasons 
why many practical problems of great signihcance remain unsolved for decades. Here we study the 
dynamics of networks evolving according to preferential attachment, known to approximate well the 
large-scale growth dynamics of a variety of real networks. We show that this dynamics is Hamilto¬ 
nian, thus casting the study of complex networks dynamics to the powerful canonical formalism, in 
which the time evolution of a dynamical system is described by Hamilton’s equations. We derive 
the explicit form of the Hamiltonian that governs network growth in preferential attachment. This 
Hamiltonian turns out to be nearly identical to graph energy in the configuration model, which 
shows that the ensemble of random graphs generated by preferential attachment is nearly identical 
to the ensemble of random graphs with scale-free degree distributions. In other words, preferential 
attachment generates nothing but random graphs with power-law degree distribution. The exten¬ 
sion of the developed canonical formalism for network analysis to richer geometric network models 
with non-degenerate groups of symmetries may eventually lead to a system of equations describing 
network dynamics at small scales. 
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I. INTRODUCTION 


Large real networks—social, biological, or 

technological—are complex dynamical systems dl¬ 
ls]. Understanding the dynamics of these systems is a 
key to better prediction and control of their behavior, 
and the behavior of the processes running on them, 
such as epidemic spreading nisi and cascading failure 
propagation El El- Network dynamics can be roughly 
split into two categories: large-scale and small-scale. 
Large-scale dynamics usually means network growth, 
for example the growth of the Internet over years. 
Small-scale dynamics refers to the dynamics of links in a 
given network at small time scales, for example real-time 
interactions among mobile phone users or genes in a 
cell. It is quite unlikely that the small-scale dynamics of 
different networks can be in any way similar, so it seems 
that natural options to study and predict this dynamics 
can only be purely phenomenological, including data 
mining, model building, and parameter fitting El 
with all their caveats [iniiii]. However if two dynamical 
systems behave differently, it does not mean that the 
laws that govern their dynamics are different—the 
simplest example would be the quite different dynamics 
of two and three gravitating bodies of similar masses 
in empty space. And indeed if considering network 
dynamics we move from the small to large scale, we 
observe that preferential attachment accurately 

describes the growth of many very different real net¬ 
works [THlHSj . This observation raises two questions: 
(1) can preferential attachment be formulated within the 


canonical approach in physics, and if so, then (2) does 
the same approach apply to network dynamics at small 
scales? 

Here we answer positively the first question by show¬ 
ing that preferential attachment can be fully described 
within the canonical formalism. That is, we show that 
the growth dynamics of networks evolving according to 
preferential attachment is Hamiltonian, and derive the 
explicit form of the Hamiltonian in the corresponding 
Hamilton’s equations. This Hamiltonian turns out to be 
nearly identical to the Hamiltonian in the soft configura¬ 
tion model. 

In the canonical formalism the dynamics of a sys¬ 
tem with canonical coordinates {q,p) and Hamiltonian 
which is usually the total energy of the sys¬ 
tem, is described by Hamilton’s equations 


q = 


m 

dp ’ 



( 1 ) 


The canonical approach has a long history of success 
in physics. All the fundamental interactions in nature 
are described by Euler-Lagrange or Hamilton’s equations 
with different symmetry groups m- The Einstein field 
equations in general relativity are Euler-Lagrange equa¬ 
tions for the gravitational Einstein-Hilbert action, while 
the ADM formalism is the corresponding Hamiltonian 
formulation |20] . Here we extend the canonical formal¬ 
ism to complex networks, and find the Hamiltonian de¬ 
scribing the dynamics of growing networks in preferential 
attachment. 

Most network models can be classified as either equilib¬ 
rium models, which are the ensembles of graphs of fixed 


2 


size, e.g. the Erdos-Renyi random graphs pTh23) . or non¬ 
equilibrium models, in which networks grow with time, 
e.g. the preferential attachment model [IMll- Equilib¬ 
rium models are usually more amendable for analytical 
treatment, while non-equilibrium models better mimic 
the growth dynamics of real networks. In the special 
case of uncorrelated random graphs, there exist grow¬ 
ing network models that produce equilibrium ensembles 
of graphs with an arbitrary degree distribution [24) . In 
general however, the two types of models, growing and 
equilibrium, are very different, and so are the ensembles 
of random graphs that they define. A great number of 
works have studied equilibrium ensembles. A statistical 
theory of equilibrium correlated random graphs was de¬ 
veloped in |25j . Review article [26] surveys important 
advances in the field of equilibrium network models. In 
particular, it discusses the structural properties of the 
graphs and topological phase transitions in equilibrium 
graph ensembles. An interesting “symbiotic” model, an 
equilibrium network model with fixed number of nodes 
and links which evolves using a local rewiring move, was 
studied in m- It was shown that if the graph Hamilto¬ 
nian is chosen appropriately, then the networks generated 
by the model are scale-free. In all the prior works how¬ 
ever, the graph Hamiltonians appear only in the equi¬ 
librium sense, i.e. as the graph energy proportional to 
the logarithm of the graph probability in the equilibrium 
ensemble. To the best of our knowledge no prior work 
has studied Hamiltonian dynamics of growing networks, 
where the graph Hamiltonian is the graph energy which 
defines the dynamics of growing graphs via Hamilton’s 
equations Q. 

Our starting point relies on recent results [2H] es¬ 
tablishing the special conditions under which there ex¬ 
ists strong equivalence or duality between equilibrium 
and non-equilibrium (growing) graph ensembles 
where is a set of graphs of size N, and P is a proba¬ 
bility distribution on Two network models or graph 
ensembles (^Ar,Pi) and (t/Ar,P 2 ) are equivalent if they 
generate any graph G G Gn with the same probability, 
i.e. Pi(G) = P 2 (G) for any G. We say that two ensembles 
are strongly equivalent, if they are equivalent for any N. 
Using these special conditions, we obtain an intermediate 
result, which is important in its own right. It gives an 
equilibrium formulation of preferential attachment. This 
formulation is useful because it allows, for the first time 
to the best of our knowledge, to explicitly calculate for 
any given graph G, e.g., a given real network, the prob¬ 
ability P(G) (X exp[—iJ(G)] with which preferential at¬ 
tachment generates this graph, where H{G) is the graph 
Hamiltonian (graph energy). This Hamiltonian turns out 
to be very similar to the Hamiltonian in the soft configu¬ 
ration model [2nHSI]- Based on this intermediate result, 
we then derive the dynamic Hamiltonian "H that governs 
the network evolution in preferential attachment. 

Remarkably, the static Hamiltonian H and its dynamic 
counterpart H turn out to be nearly identical. The only 
difference between the two is that exact node degrees 


in H are replaced by their expected values in "H. We 
thus prove that preferential attachment and configura¬ 
tion model are in fact the same ensemble, or in other 
words, that preferential attachment generates nothing 
but random graphs with a given power-law degree dis¬ 
tribution. One could in principle expect that to be true 
in view of several equilibrium(-like) approaches to pref¬ 
erential attachment [5^34) . In [32] it is shown how 
the hidden variable formalism introduced for equilibrium 
graph ensembles can be applied to the preferential at¬ 
tachment networks to derive their degree distribution and 
the correlation structure. In a similar spirit, |33j applies 
methods of statistical mechanics to compare equilibrium 
and non-equilibrium graph ensembles and, in particular, 
demonstrates that the degree distribution in preferential 
attachment is identical to that in the equilibrium ensem¬ 
ble of random trees. Recent work [34) . among other re¬ 
sults, proves that a minor modification of the preferential 
attachment model, called sequential preferential attach¬ 
ment, is identical to the equilibrium graph ensemble con¬ 
structed using several Polya urn processes. In this work 
we prove that the expectation that preferential attach¬ 
ment and configuration model are nearly equivalent is 
indeed correct. 


The flow of logic in the paper, and a more detailed 
summary of the results are as follows. We begin with a 
recollection of basic facts concerning exponential random 
graph models (ERGMs) [311111133135] (Section |H| and 
models of random graphs with hidden variables 13137] 
(Section |HI[ ) that we will need in subsequent sections. In 
particular, in latter models, an equilibrium random graph 
ensemble is fully defined by a distribution p[r) of hidden 
variables r attached to nodes, and connection probabil¬ 
ity p{r, r') between nodes, and two such ensembles are 
equivalent (Pi(G) = P 2 (G)), as soon their hidden vari¬ 
able distributions and connection probabilities are the 
same, pi(r) = P 2 ir) and pi(r, r') = p 2 (r,r'). Random 
graphs with hidden variables are not ERGs per se, but 
they are collections of ERGMs with fixed values of hidden 
variables playing the role of Lagrange multipliers. 


We then briefly discuss soft preferential attachment 
(SPA, Section IV). SPA is different from standard pref¬ 
erential attachment (PA) in only that in PA new links 
attach to existing node i with probability proportional 
to its degree k^, while in SPA new links attach to exist¬ 
ing node i with probability proportional to its expected 
degree Ki. The idea behind the SPA definition is to as¬ 
sign to each new node j a hidden variable rj ^ log j, and 
then connect j to existing node i with certain probability 
that depends only on the current values of j’s and i’s hid¬ 
den variables rj and r^. The words current values appear 
here because the values of these hidden variables r need 
to be updated as the network grows, and the combina¬ 
tion of this update rule and connection probability are 
such that new nodes do indeed connect to existing nodes 
with probability proportional to their current expected 
degrees, so that we do have SPA. The key point behind 
SPA is that it has a coupled dynamics of hidden vari- 
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ables ri and expected degrees Ki, both growing functions 
of the network size TV = 1, 2,... or “cosmological time” 
t ^ log TV. 

Next, in Section |Vj we show that SPA is an ERGM, 
which is asymptotically (TV 1) identical to the soft 
configuration model (SCM) for sparse graphs with aver¬ 
age degree «C TV. We do this in steps. We first recall, 
in Section [V A[ that the SCM is an ERGM with Hamil¬ 
tonian H{G) = '^^kin + C, where ki is the degree of 
node i in graph G, is the Lagrange multiplier fixing 
the expected values of ki in this ERG ensemble to some 
value (ki), and C are some additional terms. This ensem¬ 
ble is an equilibrium ensemble of random graphs with a 
given sequence of expected degrees {ki). If this sequence 
is power-law-distributed, then the sequence of r^s is ex¬ 
ponentially distributed. If this r-sequence is not fixed 
but sampled, for each graph, from a fixed exponential 
distribution p{r), then the resulting SCM, is an ERGM 
with hidden variables r of random graphs with a given 
expected power-law degree distribution. The next two 
most technical sections deal with certa in cosmetic 
justments to SCM (SCM+, Section VB) and SPA (SPA, 
Section VC), and in Section V D we show that after 


these adjustments, the two models (SCM+ and SPA) are 
asymptotically equivalent, and derive their equilibrium 
ERG Hamiltonians H in Section fV El 

Finally, in Section VI we turn back to the dynamics of 
SPA, and pose the question: is there a dynamic Hamil¬ 
tonian 'H{Ki,ri,t) such that the dynamics of expected 
degrees Ki (t) and hidden variables (t) in SPA is the so¬ 
lution of Hamilton’s equation with this Hamiltonian H? 
We answer this question positively by first derivin g the 


exact dynamics of KiS in SPA and SPA (Sections VIA 
and [yTHt , and then finding a whole family of Hamil 
tonians that provide a solution to the question above 
(Section VIC). This family is parameterized by arbitrary 
functions ^i (T), and we show in the same section that ^i (T) 
can be selected such that the resulting dynamic Hamilto¬ 
nian H in SPA, evaluated on the solution of Hamilton’s 
equation, is equal, for any value of graph size TV, to the 
equilibrium SCM’*' Hamiltonian H in Section VE upon 
substitution Ki(TV) = ki. 


II. EXPONENTIAL RANDOM GRAPHS 


The exponential random graph model (ERGM) [51 [551 
[35l [36] is one of the most popular and well-studied equi¬ 
librium network models, also known as the p* model in 
the social network research community [551 - ^0] . ERGM 
is a graph ensemble (^Ar,P), where is the set of all 
simple graphs (i.e. undirected graphs without self-loops 
or multi-edges) on TV nodes, and P is the probability dis¬ 
tribution on Qn that maximizes the Gibbs entropy 


subject to the constraints 


{xi) = x^, i = l,...,r. (3) 

The Xi in the above relation are certain graph properties 
(e.g. number of edges or number of triangles) often re¬ 
ferred to as the graph “observables”, Xi are the prescribed 
expected values of these observables in the model, and (•) 
denotes the expectation with respect to P. Intuitively, an 
ERGM is a “maximally random” ensemble of graphs with 
fixed values Xi for certain ensemble averages (xi). Math¬ 
ematically, the maximization of randomness corresponds 
to the maximization of the entropy (§. Constraining 
the expected rather than exact values of graph observ¬ 
ables relaxes the topological conditions on the network 
and makes the model amenable to analytical treatment. 

The constrained optimization problem (§ and (§ can 
be solved by the standard method of Lagrange multipli¬ 
ers, and it has the following explicit solution [29] 

e-H{G) 

P(G) = (4) 

where 

E (5) 

GeSiv 

is the partition function, i.e. the normalizing constant for 
distribution Q, and 


r 

H{G)^Y.^MG) ( 6 ) 

i-1 

is the graph Hamiltonian, i.e. the energy of microstate 
G in the equilibrium Boltzmann distribution Q. The 
parameters 6i are the Lagrange multipliers (“auxiliary 
fields”) coupled to observables Xi. They are determined 
by the following system of r equations 

dF 

^=S.. i = ( 7 ) 

where F = —\aZ \s the free energy. The ERGM dis¬ 
tribution Q is thus fully determined by the observables 
Xi and their expected values Xi. In [41], the ERGM is 
extended to exponential random simplicial complexes. 

As an example of an ERGM, which we will refer to 
in Section jV A[ consider the edge-independent random 
graph (EIRG) model. In this case, the graph observables 
are the graph edges: Xy = a^, where a = (aij) is the 
adjacency matrix of G € Qjv. The constrains (|^ are 
then 


{at])=Pij, i<j, (8) 

where 0 < pij < 1, and the Hamiltonian is 


S'(P) = - P(G) InP(G) ^ max, (2) 

GGSiv 


H{G) — ^ij ^ij ■ (9) 

i<j 
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Unlike many other examples, the partition function Z 
can be calculated exactly for the EIRG model [29] : 

Z = n(l + e”®^0- (10) 

i<j 


The relationship between the Lagrange multipliers 6ij 
and the model parameters pij follows from Q 

1 , , 
= 1 I ■ (11 

1 + e 

Knowing the partition function allows to find the corre¬ 
sponding ERGM distribution 0 

= (12) 

i<j 


This expression immediately suggests how to generate 
graphs from the maximum-entropy ensemble (0jv,P): 
connect every pair {i,j) of distinct nodes i ^ j, i,j = 
1,..., TV, independently at random with probability pij. 
We remark that (11) is nothing but the Fermi-Dirac 
distribution, where the Lagrange multiplier 6ij is inter¬ 
preted as the energy of the “single-particle” state 
Throughout the paper, we will often use the so-called 
classical limit for the Fermi-Dirac distribution, i.e. if the 
energy 9ij is large, then pij « . 


III. RANDOM GRAPHS WITH HIDDEN 
VARIABLES 

Random graphs with hidden variables [32l |37] are en¬ 
sembles of random graphs in which graphs are generated 
(or sampled) as follows. Each node i = 1,..., TV is first 
assigned a hidden variable ri, sampled from the probabil¬ 
ity distribution p{r), and then each pair of nodes (i,j) is 
connected with probability Pij = p{ri, rj). Since the hid¬ 
den random variables are independent, the probability 
P(G) of graph G in the ensemble (0Ar,P) is 

P(G) = [ P(G|r)p(r)dr 

TV (13) 

Y[p'ij' (1 - n ^(D)t^D, 

i>j i—1 

where {aij) is G’s adjacency matrix and r = (ri,..., r^). 
This equilibrium graph ensemble is thus fully defined by 
two functions: the hidden variable PDF p{r) and the 
connection probability function p{r, r'). 

In equilibrium statistical mechanics, two ensembles 
are equivalent if their state occupation probabilities are 
the same for any state. Similarly, two graph ensembles 
(I/Ar,Pi) and (G]v,P 2 ) are equivalent if they generate any 
graph G € Gn with the same probability: 



(14) 


It follows from Eq. (13) that two ensembles of random 
graphs with hidden variables are equivalent if their hid¬ 
den variable distributions and connection probabilities 
are the same: 


Pi{r) = p 2 ir), 
Pi{r,r') =p2ir,r'). 


(15) 

(16) 


Random graphs with hidden variables are closely re¬ 
lated to exponential random graphs. If we sample all 
hidden variables from p(r) just once, and then fix 
them, then the probability of graph G in the ensemble 
with these fixed r^s is given by (12), with pij = p{ri, Vj). 


Rewriting these p^s as 1/ (l -I- makes the ensemble 
manifestly identical to the EIRG ensemble with Lagrange 
multipliers 9ij. Therefore one can think of graphs with 
unfixed (sampled) hidden variables as a collection EIRGs 
with “randomized” Lagrange multipliers sampled from a 
fixed distribution. 


IV. SOFT PREFERENTIAL ATTACHMENT 


Our first goal is to represent the preferential attach¬ 
ment (PA) model as an ERGM. The original formulation 
of PA [12], where a new node connects to an existing 
node with probability proportional to its degree, is very 
intuitive, but not convenient for this purpose. Instead, 
we will use a hidden variable formulation of PA. It was 
shown in [42] that PA can be formulated as a hidden 
variable model, which generates growing networks up to 
some size TV with average degree k and power-law expo¬ 
nent 7 > 2, as follows. For each new node i = 1,..., TV: 

1. Assign to node i hidden variable 

ri = \ni. (17) 


2. Update the values of hidden variables of all existing 
nodes j < i hy setting 


rj{i) = I3rj + {1 — f3)ri, where 
1 

P = 


7 - 1 


(18) 

(19) 


3. Connect node i to each existing node j < i with 
probability 


Py = 

where 

Ri = ri- In 


1 _(_ grj{i)+ri-R, 

1 - 


( 20 ) 


_ and m= (21) 

to(1 -/3) 2 ^ ^ 


In large sparse networks, where is large and /3 < 1, 
the linking probability 

Pij « m(l — oc 



Pi(G) =P2(G). 


( 22 ) 









5 


In Section VIA we show that the expected degree of 
node j at time i is 


m(l - /3) f iY , ^{213 - I) 


(23) 


and therefore, the probability that i connects to j is ap¬ 
proximately a liner function of j’s expected degree 
We note that Eq. ( [2^ holds for /? < 1, i.e., 7 > 2. The 
corresponding relation for the limit /3 —>■ I (7 —>■ 2) is 
derived in Appendix | A.I| We refer to the hidden vari¬ 
able formulation of PA as the soft preferential attach¬ 
ment (SPA) model. Figure [^a) shows a doubly logarith¬ 
mic plot of the empirical degree distribution in a network 
generated by SPA along with the fitted power-law distri¬ 
bution. 

A conceptually similar formulation of preferential at¬ 
tachment as an equilibrium network model with hidden 
variables was first introduced in [32) . where the hid¬ 
den variable of node i is simply its injection time, 
Ti = i. For technical reasons that will become appar¬ 
ent in the next section, here we define = Ini, so that 

can be identified with the cosmological time of birth 
of node i [HI |33] . 


V. SPA AS AN ERGM 

Let {Gn,^spa) be the ensemble of graphs induced by 
SPA, where Pspa(G) is the probability that SPA gener¬ 
ates G e Qn- In this section we show that 

PsPA(G)cxe-^^'’^(^), (24) 

where the SPA Hamiltonian iLsPA is intimately related 
to the Hamiltonian in the soft configuration model. 


A. Soft Configuration Model 

The soft configuration model (SCM) is an 

ERGM, where graph observables are node degrees ki, 
i = I,..., N. The model has various equivalent formula¬ 
tions, and, in particular, SCM appears as a special degen¬ 
erate case of the equilibrium hyperbolic model [43] in a 
certain limiting parameter regime. This formulation also 
belongs to the wide class of network models with hidden 
variables. Specifically, the SCM formulation in [43] gen¬ 
erates equilibrium networks of size N with average degree 
k and power-law exponent 7 > 2 , as follows: 

I. For * = I,..., TV, assign to node i hidden variable 

r*-psCM(r)«ae“G-«scM)^ 0 < r < RgCM, (25) 

where a = f3~^ = 7 — I and 

N 


2. Connect nodes i and j, j ^ i, with probability 


1 

X _|_ g^Ti+Vj-RsCM ■ 


(27) 


A more familiar (but less convenient for our purposes) 
formulation of the SCM is obtained by the change of 
hidden variables h = /iq = ^(1 — /?)■ The 

hidden variable h has then the power-law distribution 
h ~ p{h) oc h~'^ and the connection probability is 
Pij = 1/(1 -f Nk/hihj) pFl fill H51. Comparing t^ con¬ 


nection probabilities in EIRG (11) and SCM (27), we 


readily obtain that the Lagrange multiplier 9ij in SCM 


IS 


% = Ti + Tj — i?sCM- (28) 


The SCM Hamiltonian is then 


Hsgm{G) = ^(ri -I- Tj — i?sCM)aij 

i<j i<j 

N (29) 

= ^ kiVi — MRscm, 

i=l 


where ki is the degree of node i and M is the total number 
of edges in the graph G. 

Figure [^a) shows the degree distribution in a network 
generated by SCM, which is identical to the degree dis¬ 
tribution in an SPA network generated with the same 
parameters. As expected, both are power-laws with ex¬ 
ponent 7 . Figure [^b) shows the empirical distributions 
of the hidden variables Pspa(?') and Pscm(?') in the gen¬ 
erated networks. Although both distributions are highly 
skewed to the right, there is a clear discrepancy between 
them. In the next section we fix this discrepancy by in¬ 
troducing a shifted SCM model, which is strongly equiv¬ 
alent to SCM, but has the same distribution of hidden 
variables as SPA. 


B. Shifted Soft Configuration Model 


While the hidden variables in SCM are random, in SPA 
they are deterministic. Nevertheless we can readily over¬ 
come this technical obstruction that hinders the compar¬ 
ison of hidden variable distributions in the two models. 

Let r*(A) denote one of the hidden variables 
ri{N),... ,ri\jiN) in SPA chosen uniformly at random 
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Hidden variable r 


FIG. 1: SPA and SCM networks. Panel (a) shows the em¬ 
pirical complementary cumulative degree distribution func¬ 
tions (CCDF) Pc{k) = ^*(^0 for two networks of size 

N = 10“^ generated by SPA and SCM with fc = 10 and 
7 = 3, and the corresponding power-law fit. As expected, 
Pc(fe) ~ k Panel (b) shows the empirical probability 

density functions (PDF) of the hidden variables in these 
two networks. 


at time ^ 1. The CDF of r^{N) is then 

|{z : ri{N) < r}| 


Fr,{N){r) =P(r,(A^) < r) = 


N 


|{i : /31nz -b (1 — ,5) InfV < r}| 

' N 

■ ■ — = A7Ei (30) 


N 


■ ^ S-CAlnN 
i<e P P 


1 / r-ln N 

.- / Id* = e ^ . 


When N is large, the SPA hidden variables 
ri{N),... ,rj^(N) can be viewed as being approxi¬ 
mately i.i.d. samples from This distribution 


has the following PDF 

Pspa(?') = ^Pr.{N){r) = , (31) 

which is structurally similar to the distribution of hidden 
variables in SCM 

/OsCM(r') « Q,g“(r-flscM) _ p + ^ where (32) 

cr = lnfc(l-/3)2. (33) 

It is readily verifiable that the approximate supports of 
PSP Air) and psc'M{r), i.e. segments that contain almost 
all probability mass of these distributions, are 


supp ppA = [(1 -/3)lnA^,lniV], (34) 

supp pscM = [(1 - /3) In - CT, In iV - cr]. (35) 


Therefore, it immediately follows from (31)-(35) that 
PSPA^r) is obtained from pscM{r) by translation by a. 
This motivates the shifted SCM model, denoted SCM+, 
which generates networks of size N with average degree 
k and power-law exponent 7 > 2 , as follows: 


1. For i = 1,..., Af, assign to node i hidden variable 
rf' by, first, sampling ^ Pscm(t')) and then shift¬ 
ing = Ti + a, where cr = In fc(l — /3)^. 

2. Connect nodes i and j, j ^ i, with probability 


Pij = 


I -|- g’'i' ^SCM + 


(36) 


where Rscm+ is given by (39). 

By construction, the distributions of hidden variables 
in SCM+ and SPA are identical 


PSCM+ (r) = PsCM(r -cr) = pspA{r)- (37) 

To make SCM+ equivalent to SCM, we need to choose 
i?scM+ appropriately. If two nodes have hidden variables 
r and r in SCM, then in SCM+ the values of these hid¬ 
den variables are r~^ = r + a and r~^ = f + a. The 
two models will be strongly equivalent, i.e. will gener¬ 
ate graphs G G Gn with equal probabilities Pscm(G) = 
PscM+ (G)) if fiia connection probabilities pscm(t', r) and 
Pscm+(^''’)’^''’) are the same. This leads to the following 
equation for it!sCM+ 

r + r- i?scM = r+ -b - i?sCM+ ■ (38) 

Therefore, 

i?gc^+ =lnAf-bcr = lnAfc(I-/3)2. (39) 

It is convenient to work with SPA and SCM+ (instead 
of SCM) since not only the degree distributions in the 
networks generated by these two models match, but also 
the distributions of hidden variables are the same. Our 
next step is to adjust SPA so that it becomes strongly 
equivalent to SCM+ (and, therefore, to SCM). 
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C. Bridging SPA and SCM+ 

Matching degree distributions is a necessary but, of 
course, not sufficient condition for model equivalence. In 
SPA, a link between nodes i and j < i may appear only 
at time i upon the birth of the younger node. We refer 
to such links — appearing at time i and connecting new 
node i to already existent nodes — as “external” links. 
We make the SPA model equivalent to SCM+ by also al¬ 
lowing “internal” links that appear at time i and connect 
old nodes a and &, where a,b < i. Namely, we define the 
model SPA that generates growing networks up to some 
size N with average degree k and power-law exponent 
7 > 2, as follows. For each new node i = 1,..., N: 


Let us now compute the expected degree of a new node i 
upon its birth in SPA. For large i, i?™* « ri-|-lnTOext(l — 
f3), and using the classical limit for the Fermi-Dirac dis¬ 
tribution, we get 


«i(i) =J2pI 


,ext 

U' 


dj 


j<i 


1 _|_ grj(i)+ri-Rf‘'' 

dj 


1 _|_ g/3T-j-|-(l-/3)ri-lnmBxt(l-/3) 

dj mext(l-/3) 


(45) 


1 _|_ 

rnext(l-/ 3 ) 




j '^dj 


—'^ext- 


1. Assign to node i hidden variable = Ini. 

2. Update the values of hidden variables of all existing 
nodes j < i by setting rj{i) = /3rj + {l — /3)ri, where 

/3=^- 

3. Connect node i to each existing node j < i with 
probability 


where 



= r, - In 


mext(l - 0) 


(40) 


(41) 


and TOext given by (46). 


4. Connect each pair of existing nodes a,b < i with 
probability 


Every new node thus establishes on average ruext links, 
and, as time goes, its degree may only increase. This 
means that Wext is the expected minimum degree in SPA, 
(^min)spA = iTT-ext, and therefore 


TOext = k{l - /3). 


(46) 


Another necessary condition (that helps to determine 
TOint) for the equivalence between SPA and SCM+ is that 
the expected average degrees in both models must be the 
same. That is, if in SCM"*" the expected average degree 
(fc)gCM+ equals k, then we must have 




(47) 


Let denote the expected number of internal links 
generated at time i. Then, the expected total number of 
links generated at time i is TOext + Z)"*, and the expected 
average degree in the network is given by 


Pab (^) ^ : 

(42) _ 2 

(^)sTa ~ 

(48) 

= In TOint (1 - /3), 

For large i, 

(43) 



where 


and TOint given by ( [50| . 

In Step 4, we scan all pairs of existing nodes and attempt 
to connect even those nodes which are already connected. 
The SPA model thus allows multi-edges. In large sparse 
{k N) networks, however, the proportion of multi¬ 
edges is small. For example, the expected ratio of multi¬ 
edges in SPA networks of size N = 10^, 10^, and 10^ with 
fc = 10 and 7 = 2.5 is, respectively, 7%, 4% and 2%. We 
can therefore ignore the multi-edge effect. The choices for 




a<ib<a 


dbda 


/o JO 1-k 
dbda 


0 JO 


X _|_ gl3ra+f3rb+2{l-P)ri-lnmi„til-l3) 
dbda 


TOext and TOint (Eqs. (46) and (50)) are explained below. 

First, a necessary (but not sufficient) condition for the 
equivalence of two models is that the expected minimum 
degrees in the two models must be the same. The ex¬ 
pected minimum degree in large networks generated by 
SCM (and therefore by SCM+) is fc(l — /3) [43]. Thus, 
we have the following condition 


10 1 -k 

,Wint(l - /3) 
' j2(l-/3) 

mint 

''2{i-py 


a/3b/3i2(i-/3) 


(49) 


"lint (1-/3) 

a~db~d dbda 


0 JO 


Combining (46)-(|49[), we get TOint 

iHint = fc(l - /3)(2,0 - 1), 


(50) 


(^min)cpA — ^(1 /5). 


(44) 


which is positive if /? € ( 1 / 2 , 1 ), or, equivalently, 7 € 
(2,3). We note that 2 < 7 < 3 is exactly the range of 




















power-law exponents empirically observed in most real 
networks [4^. Figure]^ shows the perfect match between 
the distributions of node degrees and hidden variables in 
SPA and SCM+ networks with 7 = 2.5. Recall that the 
match between the hidden variable distributions is the 


first condition (15) for two ensembles of random graphs 


with hidden variables to be equivalent. 

We note that as /3 —>■ 1/2, or, equivalently, 7 —t 3, SPA 
becomes manifestly identical to SPA. Indeed, in this case. 


TOext -t w = - and mint -t 0, (51) 


which means 




and 


p'abii) 0. Vz. (52) 
1 (7 —>■ 2 ) is analyzed in 


The other limiting case /3 
Appendix | A.l| 

It is important to realize that by choosing mext and 
mint according to (46) and (1^ we only satisfied two 


necessary conditions, but we did not actually prove that 
SPA is equivalent to SCM+. We prove this in the next 
section. 


D. SPA and SCM^ are strongly equivalent 

Since the distributions of hidden variables in SPA and 
SCM+ are the same, i.e. = Pscm+ (''')> to prove 

the strong equivalence between the two models, we need 
to show that the connection probabilities in SPA and 
SCM+ are also the same. More precisely, if at time N 
the values of hidden variables of nodes j and z > / in SPA 
are r and r > r, then the probability that these two nodes 
are connected must be equal to the connection 

probability PsCM+ (’"> of nodes with hidden variables 
r and r in SCM+. In what follows, we compute these 
probabilities in large sparse graphs (iV 1, fc ^ iV) and 
show that they indeed coincide. Throughout this section 
we assume that f3 € (1/2,1) (7 G (2,3)). 

Let P^p^(b/) denote the probability that nodes z and 

j are not connected in SPA, then 



FIG. 2: SPA and SCM^ networks. Panel (a) shows the 
empirical CCDF Pc{k) = X )) c '> fe ^(^0 for ^"'0 networks of 
size N = lO'^ generated by SPA and SCM'^ with fe = 10 and 
7 = 2.5, and the corresponding power-law fit. As expected, 
for both networks Pc{k) ~ k Panel (b) shows the perfect 

match between the PDFs of the hidden variables in these 
two networks, illustrating the theoretical result Pg^ir) = 
PscM+ (r)- 


Since r and r are the values of hidden variables of nodes 
J and z at time N, we have 


Ps?A(bj) = l-PipA{bj) 

= 1 - (1 - p-‘)(l - (z + !))...(!- pjf (iV)) 


N 


rN 


p]^\s)ds. 


Let us compute the integral first 


pN pN 

^ p^\s)ds = J^ 


ds 


rN 


1 _|_ e^i{s)+rj{s)-R‘‘ 

ds 


(53) 


(54) 


\ _|_ ^Pri+/3rj+2(1-13) In s-ln mi„t (1-/3) ' 


r =rj (N) = Ptj + (1 - ^) In A, (55) 

r =ri{N) = jdvi + {1 — j3) In A. (56) 

Finding from these equations and Vj and substituting 
them into (54), we obtain 


j-N pN 

I. 7 


ds 


\2(l-/3) ' 


(57) 


miiit(l-/3) \N J 


To proceed with analytic approximation, we need to use 
the classical limit for the Fermi-Dirac distribution, i.e. to 
drop the term 1 in the denominator. We have already 
used this approximation in (45) and (49). In those cases. 
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the second terms in the denominator were fully deter¬ 
ministic, and it was readily verifiable that they are much 
larger than 1, and therefore the approximations hold. In 
(57) however, both r and r are random, r, r ~ Pscm+j 
and a certain caution is required. 

In what follows, we show that the expected value of e 
in SCM+ scales as N, and, therefore, 

TV 1. Indeed, 


OC A^2/3 > 


fRsct. 

{el = / 

^0 


PscM{x)dx 


r^scM 


e e dx 




P Jo 

HI-Pi 

^ -^SCM / 

/3e do 

(l + /3)e^ 

\ p \ p 


(58) 


^g(l+3)^JsCM _ 

OC N. 


Thus, 


__ (1 — / 3 ) m.int _ f jy 2 / 3 -l _ „' 2 / 3 - l \ 

~{2p-l)e^+^N-^(^-d)^ > 

k{l-PlN fc(l - /3)2fV2(l-/3)i2/3-l 


(59) 


oT+f 


pT+r 


The probability of the external link is 


pT = 


1 


1 


X _|_ g / 3 r - j +( l -/ 3 ) ri - lnmext ( l -/ 3 ) 


(60) 


1 


k{l-PlN— 


r I 

e ^ /3 




where the last approximation holds because, using (58), 


o’’+ 


N (X N. Combining (53), (59) and (60), 


we obtain that the probability that nodes i and j in SPA 
are connected is 

PspaP’-'I 


fc(l - /3)2fV2(l-/3)j2/3-l fc(l_^)2ivV 


oT+r 


g’'+ — 


1-/3 . 


■ ( 61 ) 


To compare this probability with the connection prob¬ 
ability in SCM+, we need to rewrite it fully in terms of 
hidden variables r and r. From (56), = ^ — 1^ IniV. 

Therefore, 


i = ef 


- In A 


= ef> N P . 


(62) 


Substituting this expression into (61), we obtain after 


some algebra that the last two terms cancel out, and 




(63) 


It remains to show that (63) is, in fact, the connection 
probability in SCM+. Indeed, 

I 


PsCM+{r,r) = 


1 -)- g’’+’" ^SCM+ 

1 


X _ j _ gr - t - r—In Afc(l —/ 3)2 

I H^-PlN 


(64) 


1 


pT+r 


Afe(l-/ 3)2 


where the last approximation holds because ^ 

In Appendix ] A. 2[ we show the high accuracy of this ap¬ 
proximation with simulation. Fig. [^a ) juxtaposes the 
empirical connection probabilities in SPA and SCM’*' net¬ 
works, and the corresponding approximation in (64). Re¬ 
call that the match between the connection probabilities 
is the second condition for two ensembles of random 
graphs with hidden variables to be equivalent. 


Thus, we proved that, for large networks, SPA is equiv¬ 
alent to SCM+ in the strong sense. That is, for any 
N :$> 1, SPA and SCM+ generate graphs G G Gn with 
the same probability, Pgp^(G) = PgcM+(G'). Therefore, 
the expected values of all graph properties in the ensem¬ 
ble, not only of the degree distributions in Fig.j^a), are 
the same. As an example. Fig. [^b) shows the vertex-to- 
vertex distance distribution d{l), which is the distribution 
of hop lengths I of shortest paths between nodes in the 
network, or the probability that a random pair of nodes 
are at the distance of I hops from each other. 


The following diagram summarizes the relationships 
between the four considered network models 


SPA -w SPA « SCM+ A SCM, (65) 


S ^ 

where = denotes the strong model equivalence, « is an 
approximate strong equivalence that becomes exact in 
the sparse graph limit {N ^ oo, k N), and de¬ 
notes the model transformation allowing internal links. 
Furthermore, 


SPA 


P = l /2 


= SPA 


P = l /2 


( 66 ) 


that is, SPA and SPA are strongly equivalent if /3 = 1/2, 
i.e., 7 = 3. 
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4 6 8 10 12 14 16 18 

r + r 



distance 


FIG. 3: Connection probabilities and vertex-to-vertex 
distance distribution in SPA and SCM^ networks. 
Panel (a) shows the empirical connection probabilities in net¬ 
works of size N = 10^ generated by SPA and SCM^ with 
fc = 10 and 7 = 2.5. The y-axis in the main plot is in logarith¬ 
mic scale. As expected, the connection probabilities match 
remarkably well. The plot also shows the corresponding ap¬ 
proximation given in (641. As expected, this approximation 
holds very well for sufficiently large values of r + f, which cor¬ 
respond to the vast majority of node pairs, cf. Fig. ib) and 
Appendix A. 2 Panel (b) shows the distance distributions 
in the same networks. The j/-axis in the inset is in logarith¬ 
mic scale. As a consequence of the equivalence of SPA and 
SCM"*", the distance distributions also match remarkably well, 
as expected. In all cases the results were averaged across 100 
networks. 


E. Hamiltonians of SCM+, SPA, and SPA 


As discussed in 
with Hamiltonian 


Section V A[ SCM is the ERGM model 


— SCM+ must have the same Hamiltonian. Indeed, 

N 

-ffscM+ = ^ kivf — 

+ (68) 

^ n AT \ ^ 


Further, since SPA is strongly equivalent to SCM+ and 
the distributions of hidden variables in thesejiwo models 
are the same, the ERGM Hamiltonian of SPA is 


N 

-^SPA “ 

i=l 

N 

= ^ kir^ 

i^l 

N 

= ^ fciG 
2 = 1 


-^-Rscm+ 


(In A + a) 
2 




2=1 


In {Nkjl - pf) ^ 

— 2 - 

i-l 


(69) 


Finally, since SPA becomes manifestly identical to SPA 
as /3 —>■ 1/2, or, equivalently, as 7 —>■ 3, we can write the 
ERGM Hamiltonian for SPA 


N 

Hspa = Y - 
2=1 


In (^Nk/4Tj 
2 




i-l 


(70) 


This result means that if /3 = 1/2 (7 = 3), then, in the 
sparse graph limit, SPA is exactly ERGM with Hamil¬ 
tonian ( [70| . We note that this case corresponds to the 
original Barabasi-Albert model with the scaling expo¬ 
nent 7 = 3 [12]. If /3 7 ^ 1/2, then (70l is an approximate 
Hamiltonian of SPA. To the best of our knowledge, this is 
the first result where the preferential attachment model is 
represented as an exponential random graph model with 
explicitly written Hamiltonian. Even more remarkably, 
as we show in the next section, a Hamiltonian that is 
very similar to the ERGM Hamiltonian ( |70| ) describes 
the Hamiltonian dynamics of growing networks in SPA. 


VI. HAMILTONIAN DYNAMICS OF SPA 


N 


Hscm = ~ MRscm 


z=l 

N 


= Y ~ 


(In — a) 


N 


2=1 


(67) 


Y^-- 


Since SGM+ is strongly equivalent to SCM — the two 
models differ only by parametrization of hidden variables 


The key idea in deriving the ERGM Hamiltonian of 
SPA (701 was to construct a modified model SPA that: a) 
is strongly equivalent to a graph ensemble with a known 
Hamiltonian; and b) coincides with SPA under certain 
values of the model parameters. Here we adopt a similar 
strategy: we study the Hamiltonian dynamics of grow¬ 
ing SPA-networks, and the corresponding results for the 
Hamiltonian dynamics of SPA are obtained as a special 
case with /3 = 1/2. 
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The ERGM Hamiltonian of SPA (691 suggests that the 
canonical coordinates {qi,Pi} of a growing network are 
the node degrees ki and hidden variables An immedi¬ 
ate technical problem we face, however, is that both node 
degrees ki and network time i are discrete. We overcome 
this obstruction as follows. First, inspired by the map¬ 
ping between the hyperbolic and de Sitter spaces in [47] , 
we define 


t = l3\ni, (71) 

and treat t as a continuous time. The geometric duality 
between de Sitter spacetime, which is asymptotically the 
spacetime of our accelerating universe, and hyperbolic 
space, which is a latent space underlying real complex 
networks |421143] allows to interpret t as the rescaled cos¬ 
mological time. Second, instead of the exact (discrete) 
degree ki{t) of node i, born at time ti = /3Ini, at a later 
time t > ti, we use its expected degree Ki(t), which is a 
continuous function of t, for t > ti. 

Our next goal is to derive the time evolution of the 
canonical coordinates {Ki{t),ri{t)} in the growing SPA- 
and SPA-networks. Given that in network time i, rj(i) = 
(3rj -I- (1 — /3)ri, in the rescaled cosmological time the 
evolution of the hidden variable of node i — in both 
SPA and SPA — is 

u{t) = ti + ^-^t, (72) 

where U = /3Ini is the birth time of node i. The ex¬ 
pected node degrees, however, evolve differently in SPA 
and SPA, as we show below. 


A. Evolution of the expected node degree in SPA 

The expected degree of node j at network time i > j 


IS 


=^P3S + ^Ps3 
S<j S>j 

ds 


ds 


(73) 


1 _|_ gr,{i)+rj-Rj 


\ _|_ grj(s)+r„-Rs 


The first term is the expected degree of node j upon its 
birth, and the integral is calculated in the same way as 
the integral in (45), and it equals to m. The second term 
is 


ds 


ds 


1 _|_ grj{s)+r„-Rs 


1-k 


jP gi. — p 

m(l —/3) 




- 1 


(74) 


and, therefore. 


,(*) = 


/3 




m{2(3 — 1) 

^ ' 


This expression coincides exactly with the expression ob¬ 
tained in [14] for the expected degree of a node in the 
sharp preferential attachment model, where the link at¬ 
traction probability is a liner function of the node’s exact 
degree (versus expected degree). 

Finally, in the rescaled cosmological time. 


K,{t) 


mjl-P) m{2/3 - 1) 

a /3 


(76) 


B. Evolution of the expected node degree in SPA 


Similarly, the expected degree of node j at network 
time i > j is 

V (•) = E pj? +E p" +E E * (») ■ (77) 

s<j s>j s>j a<s 

0-^3 


The first two terms are exactly the same as in SPA (73) 
with m replaced by Wext- The last term is the expected 
contribution to the j’s degree from internal links, 





Combining all terms together, we obtain 


Kj(i) = TOext , (79) 

and, in the rescaled cosmological time, 

Ki{t) = mexte*”*' = k{l - I3)e^~*\ (80) 

If /3 = 1/2, then, as expected, the expressions for the 
expected node degrees in SPA and SPA become identical. 


C. Dynamic Hamiltonians 


We now have all the ingredients necessary to derive the 
Hamiltonian describing the dynamics of network growth 
in SPA. We want to find Hamiltonian T-L such that its 


Hamilton’s equations have (72) and (80) as the solutions. 


Let 'Hi{Ki, ri,t) be the energy contribution of node i at 
time t to the total network Hamiltonian 


N 


n = y^ni{Ki,ri,t). 


(75) 


(81) 
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Then Hamilton’s equations for node i are 

&H 


dn 


and h = -^. (82) 

Ut^i 


Formally integrating these equations, and noting that 

1-/3 


Hi — 


and 


Ti = 


/3 


(83) 


we can write the solution in the following form 

1 - 


Ur^Ki.ri.t) = mexte 


'•Ti - 


/? 


K*+?i(0: (84) 


where ^i{t) is some function of time t and model param¬ 
eters k and /3. 


Since ^i{t) does not affect the equations of motion (82), 
in principle, it can be chosen arbitrary. Remarkably, ^i{t) 
can be chosen in such way that it is the same for all nodes, 
and the resulting total Hamiltonian H is identical to the 
ERGM Hamiltonian with node degrees replaced by 
their expected values. Indeed, let 


k (2(1-13) t 

Now, consider a network snapshot at time 
T = /31n/V> 1, 


(85) 


( 86 ) 


with given (current) values of Ki and r^. Using ( [80| , (85) 
and ( 861 , we can rewrite the Hamiltonian of node i as 
follows 

H. = - k) - (87) 

The total Hamiltonian of the snapshot is then 


N 


N 






Since obtain that 


N 


"H = ^ KiU - 


(In + fj) 


N 

E 




N 

=E' 


N 


\n{Nk{l-l3f) ^ 

2 E' 


( 88 ) 


(89) 


which is exactly the expected ERGM Hamiltonian (691. 


Note that since the ERGM Hamiltonian can be inter¬ 
preted as the energy of a given network snapshot at some 
time t, the dynamic Hamiltonian with ^i(t) in (851 yields 
indeed the expected energy of the snapshot. 

To obtain the dynamic SPA Hamiltonian, all we need 
to do is to set (3 = 1/2 in the above derivations. The 
energy contribution of node i is then 


T~Li = xc* — Ki + k \ 1—bn^—t 


(90) 


and the total Hamiltonian that describes the dynamics 
of growing networks in SPA is 


2=1 2=1 


(91) 


As expected, this Hamiltonian is exactly the ERGM 
Hamiltonian of SPA (1^ with the node degrees replaced 


by their expected values. 


VII. CONCLUSION 

We have studied the dynamics of networks growing 
according to preferential attachment, and obtained two 
important results. First, we have shown that soft pref¬ 
erential attachment can be casted as an equilibrium ex¬ 
ponential random graph model, nearly identical to the 
soft configuration model. In other words, the ensemble 
of random graphs that preferential attachment generates 
is nearly identical to the equilibrium ensemble of ran¬ 
dom graphs with power-law degree distributions, mean¬ 
ing that preferential attachment and configuration model 
generate any graph G of any size N with the same prob¬ 
ability P(G). In general, this result is important because 
equilibrium network models tend to be more amenable 
for analytic treatment. In particular, this result, for the 
first time to the best of our knowledge, provides an ex¬ 
plicit expression P(G) oc exp[—i7(G)] with Hamiltonian 
H(G) (70) for the probability P(G) that preferential at¬ 
tachment generates any given network G. The knowl¬ 
edge of P(G) can be used, for example, for answering the 
question of how likely it is that a given real network has 
been grown according to preferential attachment. This 
question can now be answered by standard techniques, 
such as comparing the probabilities P(G) of the typi¬ 
cal preferential attachment networks and the real net¬ 
work under study. Another application is an alternative 
simpler method to generate preferential attachment net¬ 
works, which has already been implemented and publicly 
released as a part of a more general software package that 
generates random hyperbolic graphs |48) . 

Second, we have demonstrated that the growing dy¬ 
namics of preferential attachment networks is Hamilto¬ 
nian. Remarkably, the Hamiltonian T-L (89) that de¬ 
fines the equations of motion ( 72|80 ) describing network 
dynamics is nearly identical to the ERG Hamiltonian 
H (69). The only difference between the two is that the 


exact node degrees in H are replaced by their expected 
values in H. 

These results may appear quite surprising at the first 
glance, but there is an intuitive explanation. On the one 
hand, the equilibrium Hamiltonian H (G) in the soft con¬ 
figuration model is the energy of graph G in the Boltz¬ 
mann distribution P(G) of this exponential random graph 
model. This energy is the sum of energies of all edges 
in graph G, and one can check that the energy of edge 
{ij} is simply the sum of i's and j’s Lagrange multipliers 






















13 


Ti + rj. On the other hand, as shown in Section]^ soft 
preferential attachment, at each time t, is also a similar 
exponential random graph model, with hidden variables 
ri playing the role of Lagrange multipliers, and the en¬ 
ergy of edge {ij} at time t is also ri{t) + rj{t). 

In simpler terms, the reason behind this equivalence is 
quite physical: both the dynamic Hamiltonian in prefer¬ 
ential attachment and the equilibrium Hamiltonian in the 
configuration model are system energies, albeit the estab¬ 
lished equivalence between the growing and equilibrium 
representations of the same system is slightly atypical in 
physics [ 2 H] . 


Very few real networks can be adequately modeled as 
random graphs in the configuration model, which sug¬ 
gests that some additional terms must be added to the 
Hamiltonian to adequately describe the dynamics of dif¬ 
ferent real networks.In this context, it is an interesting 
observation that the ERG ensemble that we found to be 
equivalent to preferential attachment is a degenerate case 
of the more general geometric network ensembles, which 
can be considered as a Fermi gas in a hyperbolic space, 
whose symmetry group is the Lorentz group |42ll43j . This 
observation calls for extending the developed canonical 
formalism for network analysis to this more general geo¬ 
metric case with non-degenerate symmetries. This exten¬ 
sion is a highly non-trivial task for a number of technical 
reasons, but if successful, it may shed some light on the 
second question we raised in the introduction, concerning 
small-scale dynamics of networks. 

We have shown that preferential attachment can be 
formulated within the canonical formalism, in which the 
time evolution of a system is described by Hamilton’s 
equations q = dH/dp and p = —dH/dq. The traditional 
application of the Hamiltonian formalism in mathemat¬ 
ical physics [15] deals with the following direct prob¬ 
lem: given a Hamiltonian T-L, which in most cases is 
the energy of the system, find the solution of the cor¬ 
responding dynamical equations of motion. However, in 
physics history, the problem has almost always been in¬ 
verse: first, chronologically, the equations of motion are 
found by some other, usually experimental methods, and 
only much later it is recognized by theoreticians that 
these equations are solutions of some Hamiltonian or La- 
grangian systems defined by their symmetry groups. This 
was the case in most physics theories, from classical me¬ 
chanics [ini to general relativity [101 . Our understanding 
of network dynamics seems to have been driven along a 
similar historic path. First preferential attachment was 
suggested as a likely mechanism responsible for the emer¬ 
gence of scale-free degree distributions UMll , experi¬ 
mentally validated for many real networks [HHIH]- And 
only fifteen years later have we recognized that the prefer¬ 
ential attachment dynamics (72|80l is Hamiltonian (89). 


We emphasize however that these results hold only for 
the soft versions of preferential attachment and conhg- 
uration model. The difference between the soft conhg- 
uration model with a fixed expected scale-free degree 
sequence and the configuration model in which the ex¬ 


pected degree sequence is sampled for each graph from 
a fixed scale-free distribution has been recently quanti¬ 
fied in m- This difference is well-behaved, in the sense 
that the entropy distribution in the latter ensemble is 
self-averaging, meaning that its relative variance vanishes 
in the thermodynamic limit. However, it is known that 
the soft (canonical) and sharp (microcanonical) configu¬ 
ration models are different even in the thermodynamic 
limit—the ensemble distributions do not converge in the 
limit laiEi]. To the best of our knowledge, there are no 
results of this sort concerning the difference between the 
soft and sharp versions of preferential attachment, but 
one could expect them to be different as well. Therefore 
the existence of any connections between sharp configu¬ 
ration model and sharp preferential attachment, and the 
possibility to formulate the latter within the canonical 
approach, remain to be open questions. 
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Appendix 

A.l. SPA, S^, and SCM+ as d 1 (7 2) 

All three models — SPA, SPA, and SCM+ — have 
singularities at /3 = 1. In this Appendix, we investigate 
to what models they degenerate in the limit /3 —)■ 1 . 

The SPA model has a well-defined limit. Indeed, since 

—>■ Ti — In — and (A.92) 

m 

T’b ^ ^ _j_ ^rj+lnri—lnm ^ , j In i ’ (A.93) 

as /3 —>■ 1, SPA converges to the following simple model. 
To generate a network of size N with average degree k 
and power-law exponent 7 = 2 , for each new node i = 
1 ,..., A, connect node i to each existing node j < i with 
probability 

= —(A.94) 

' m 

In what follows, we show that the average degree in 
large networks generated by this limiting model is indeed 
k. At time N, the expected degree of node i is 

N 

(A-95) 

j<i j>i 
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The first sum is the expected contribution to the i’s de¬ 
gree from older nodes, 


j<i 


m 

Inz 


^E- = 

1 ^^ 7 

i=i 


m 

Ini 


H, 


m, 


(A.96) 


where Hi = X] 7 =i j ~ Ini is the i*^ harmonic number. 
The second sum in (A.95) is the expected contribution 
to the degree of node i from younger nodes, 


j>i * * 


where h(a;) is the logarithmic integral function. There¬ 
fore, 



/tAAT) =m-F — (li(fV)-li(i)). (A.98) 

i 

The expected average degree in the network at time N 
is then 

(A.99) 

^ ^ mli{N)HN m li(i) 

N N ^ ~r' 

i<N 


Since li(A^) « and « InfV, the second term is 
approximately m. The last term can be approximated as 
follows 

riktf =li(JV)l„JV-JV. (A.IOO) 

^<N * -^0 * 


Therefore, we finally have 


m 


k{N) =2m — —(li(iV) In A^ — N) 

mli(Af)lniV - 

=6m -—- « Zm = k. 

N 


(A.lOl) 


Figure illustrates how this approximation becomes 
more accurate as the network size increases. 

The SPA model completely degenerates as /3 —>■ 1. 
Namely, —>■—00 and —>■ — 00 , and, therefore, 

PiT 0- This means that the limiting 

model generates networks with no links. Remarkably, 
even in the limit /3 —)■ 1, SPA remains strongly equiva¬ 
lent to SCM’*'. To prove this, we need to show that in 
this limit SCM+ also generates networks without links. 

The connection probability in SCM+ is 


P^J = 


1 

1 -|- ~ ^SCM + 


(A.102) 


where r+ = -|-In (fc(l -/S)^), ri ~ Pscm(?'), and 

i?SCM+ = la (7Vfc(l — /3)^). Since 


gi"f+r'^ -^SCM+ 


e 'e ^ 




(A.103) 


FIG. 4: Average degree in SPA networks with /3 = 1 

(7 = 2). For N = 10^,10^, and 10'*, the plot shows the 
fluctuation of the average degree in 100 independent networks 
generated by the limiting (/? = 1) SPA model with k = 10. 
As expected, the lager the network size N, the more accurate 
the approximation (A.lOl I. 


and the expected value 

rt^scM 


rrcscM 

(e""') = / pscuir) dr 

Jo 


rJ^scM 


grgr-flscMJ^ 


g^SCM _ q—RsC\ 


N 


2fc(l-/3)2’ 


we have that 

p^t +’'/“-^SCM+ 


N 


4k{i-i3y 


00 . 


(A.104) 


(A.105) 


This means that the connection probability in SCM+ 
converges to zero, pij —f 0, as /3 —>■ 1, and therefore, even 
in this degenerate regime SPA and SCM+ are strongly 
equivalent. 


A.2. Accuracy of the classical limit approximation 
for the Fermi-Dirac distribution in SCM*' 

In Section |VD[ we used the classical limit for the 
Fermi-Dirac distribution in the SCM+ model 

PsCM+{r,r) f^PsCM+{r,r) = ^ ■ (A.106) 

Here we show with simulations that this approximation is 
very accurate in large networks. As an example, we con¬ 
sider networks with = 10 and 7 = 2.5. First, we gen¬ 
erate N hidden variables r from distribution Pscm+(’’)j 
and then, for each of the (^) pairs of nodes with hidden 


N 
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variables r and r, we compute the relative error of the 
connection probability approximation (A. 1061 


bsCM +(a 0 - PSCM+(a r)\ 
PscM+{r,r) 


(A.107) 


FIG. 5: Accuracy of the classical limit approximation. 

For N = 10^, 10^, and 10"^, the percentage of node-pairs with 
the relative error larger than 5% (1%) is, respectively, 40.5% 
(100%), 1.8% (13.6%), and 0.08% (1%). 


Figure shows the relative errors sorted in the increas¬ 
ing order for network sizes N = 10^,10^, and 10^. As 
expected, the larger the network, the smaller the classi¬ 
cal limit approximation error. 
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